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^ ' Abstract 

We consider the problem of rate and power allocation in a multiple-access channel. Our objective is to obtain 
O ■ rate and power allocation policies that maximize a general concave utility function of average transmission rates on 

■ the information theoretic capacity region of the multiple-access channel. Our policies does not require queue-length 

' information. We consider several different scenarios. First, we address the utility maximization problem in a non- 

fading channel to obtain the optimal operating rates, and present an iterative gradient projection algorithm that uses 
approximate projection. By exploiting the polymatroid structure of the capacity region, we show that the approximate 
^ . projection can be implemented in time polynomial in the number of users. Second, we consider resource allocation in 

a fading channel. Optimal rate and power allocation policies are presented for the case that power control is possible 
and channel statistics are available. For the case that transmission power is fixed and channel statistics are unknown, 

^ , we propose a greedy rate allocation policy and provide bounds on the performance difference of this policy and the 

'NT . 

ff^ , optimal policy in terms of channel variations and structure of the utility function. We present numerical results that 

I demonstrate superior convergence rate performance for the greedy policy compared to queue-length based policies. 

In order to reduce the computational complexity of the greedy policy, we present approximate rate allocation policies 
which track the greedy policy within a certain neighborhood that is characterized in terms of the speed of fading. 
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Index Terms 



. Multiple access, resource allocation, power control, utility maximization, fading channel, rate splitting. 

I. Introduction 

Dynamic allocation of communication resources such as bandwidth or transmission power is a central issue in 
multiple access channels in view of the time varying nature of the channel and the interference effects. Most of 
the existing literature focuses on specific communication schemes such as TDMA (time-division multiple access) 
[1], CDMA (code-division multiple access) [2], [3], and OFDM (Orthogonal Frequency Division Multiplexing) [4] 
systems. An exception is the work by Tse et al. [5], which consider the notion of throughput capacity for the fading 
channel with Channel State Information (CSI). The throughput capacity is the notion of Shannon capacity applied 
to the fading channel, where the codeword length can be arbitrarily long to average over the fading of the channel. 
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Tse et al. [5] consider allocation of rate and power to maximize a linear utility function of the transmission rates 
over the throughput region, which characterizes the points on the boundary of the throughput capacity region. 

In this paper, we consider the problem of rate and power allocation in a multiple access channel with perfect 
CSI. Contrary to the hnear case in [5], we consider maximizing a general utility function of transmission rates over 
the throughput capacity region. Such a general concave utility function allows us to capture different performance 
metrics such as fairness or delay (cf. Shenker [6], Srikant [7]). Our contributions can be summarized as follows. 

We first consider a non-fading multiple-access channel where we introduce a gradient projection algorithm for 
the problem of maximizing a concave utility function of transmission rates over the capacity region. We estabhsh 
the convergence of the method to the optimal rate allocation. Since the capacity region of the multiple-access 
channel is described by a number of constraints exponential in the number of users, the projection operation used 
in the method can be computationally expensive. To reduce the computational complexity, we introduce a new 
method that utilizes approximate projections. By exploiting the polymatroid structure of the capacity region, we 
show that the approximate projection operation can be implemented in time polynomial in number of users by using 
submodular function minimization algorithms. Moreover, we present a more efficient algorithm for the approximate 
projection problem which relies on rate-splitting [8]. This algorithm also provides the extra information that allows 
the receiver to decode the message by successive cancelation. 

Second, we consider a fading multiple access channel and study the case where channel statistics are known 
and transmission power can be controlled at the transmitters. Owing to strict convexity properties of the capacity 
region along the boundary, we show that the resource allocation problem for a general concave utility is equivalent 
to another problem with a hnear utility. Hence, the optimal resource allocation policies are obtained by applying 
the results in [5] for the linear utihty. Given a general utihty function, the conditional gradient method is used to 
obtain the corresponding hnear utihty. 

If the transmitters do not have the power control feature and channel statistics are not known, the throughput 
capacity region is a polyhedron and the strictly convexity properties of the region do not hold any more. Hence, the 
previous approach is not apphcable. In this case, we consider a greedy pohcy, which maximizes the utility function 
for any given channel state. This policy is suboptimal, however, we can bound the performance difference between 
the optimal and the greedy policies. We show that this bound is tight in the sense that it goes to zero either as the 
utihty function tends to a linear function of the rates or as the channel variations vanish. 

The greedy policy requires exact solution of a nonhnear program in each time slot, which makes it computationally 
intractable. To alleviate this problem, we present approximate rate allocation pohcies based on the gradient projection 
method with approximate projection and study its tracking capabihties when the channel conditions vary over time. 
In our algorithm, the solution is updated in every time slot in a direction to increase the utility function at that time 
slot. But, since the channel may vary between time-slots, the level of these temporal channel variations become 
critical to the performance. We explicitly quantify the impact of the speed of fading on the performance of the 
pohcy, both for the worst-case and the average speed of fading. Our results also capture the effect of the degree of 
concavity of the utility functions on the average performance. 

An important literature relevant to our work appears in the context of cross-layer design, where joint scheduhng- 
routing-flow control algorithms have been proposed and shown to achieve utility maximization for concave utihty 
functions while guaranteeing network stability (e.g. [9], [10], [11], [12]). The common idea behind these schemes 
is to use properly maintained queues to make dynamic decisions about new packet generation as well as rate 
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allocation. 

Some of these works ([10], [11]) explicitly address the fading channel conditions, and show that the associated 
poUcies can achieve rates arbitrarily close to the optimal based on a design parameter choice. However, the rate 
allocation with these schemes requires that a large optimization problem requiring global queue-length information 
be solved over a complex feasible set in every time slot. Clearly, this may not always be possible owing to 
the limitations of the available information, the processing power, or the complexity intrinsic to the feasible 
set. Requirement for queue-length information may impose much more overhead on the system than channel 
state information. On the other hand, even in the absence of fading, the interference constraints among nearby 
nodes' transmissions may make the feasible set so complex that the optimal rate allocation problem becomes 
NP-hard (see [13]). Moreover, the convergence results of queue-length based policies ([10], [11]) are asymptotic, 
and our simulation results show that such policies may suffer from poor convergence rate. In fact, duration of a 
communication session may not be sufficient for these algorithms to approach the optimal solution while suboptimal 
policies such as the greedy policy seems to have superior performance when communication time is hmited, even 
though the greedy policy does not use queue-length information. 

In the absence of fading, several works have proposed and analyzed approximate randomized and/or distributed 
rate allocation algorithms for various interference models to reduce the computational of the centralized optimization 
problem of the rate allocation policy ([14], [9], [15], [13], [16], [17]). The effect of these algorithms on the utility 
achieved is investigated in [13], [18]. However, no similar work exists for fading channel conditions, where the 
changes in the fading conditions coupled with the inability to solve the optimization problem instantaneously make 
the solution much more challenging. 

Other than the papers cited above, our work is also related to the work of Vishwanath et al. [19] which builds 
on [5] and takes a similar approach to the resource allocation problem for linear utility functions. Other works 
address different criteria for resource allocation including minimizing delay by a queue-length based approach [20], 
minimizing the weighted sum of transmission powers [21], and considering QuaUty of Service (QoS) constraints 
[22]. In contrast to this literature, we consider the utility maximization framework for general concave utiUty 
functions. 

The remainder of this paper is organized as follows: In Section n, we introduce the model and describe the 
capacity region of a fading multiple-access channel. In Section HI, we consider the utility maximization problem 
in a non-fading channel and present the gradient projection method with approximate projection. In Section IV, we 
address the resource allocation problem with power control and known channel statistics. In Section V, we consider 
the same problem without power control and knowledge of channel statistics. We present the greedy pohcy and 
approximate rate allocation policies and study their tracking behavior. Section VI provides the simulation results, 
and we give our concluding remarks in Section Vn. 

Regarding the notation, we denote by xi the i-th component of a vector x. We denote the nonnegative orthant 
by W^, i.e., = {a; G M" | a; > 0}. We write x' to denote the transpose of a vector x. We use the notation Pr(-) 
for the probability of an event in the Borel cr-algebra on M". The exact projection operation on a closed convex 
set is denoted by V, i.e., for any closed convex set X C M" and x G M", we have V{x) = argminyg^||a; — y\\, 
where 11 • 11 denotes the Euchdean norm. 
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II. System Model 

We consider M transmitters sharing the same media to communicate to a single receiver. We model the channel 
as a Gaussian multiple access channel with fiat fading effects, 

M 

Y{n) = Y,^/Hdn)Xi{n) + Z{n), (1) 

i=l 

where Xi{n) and Hi{n) are the transmitted waveform and the fading process of the i-th transmitter, respectively, 
and Z{n) is properly bandlimited Gaussian noise with variance A'^o- We assume that the fading processes of all 
transmitters are jointly stationary and ergodic, and the stationary distribution of the fading process has continuous 
density. We assume that all the transmitters and the receiver have instant access to channel state information. In 
practice, the receiver measures the channels and feeds back the channel information to the transmitters. The implicit 
assumption in this model is that the channel variations are much slower than the data rate, so that the channel can be 
measured accurately at the receiver and the amount of feedback bits is negligible compared to that of transmitting 
information. 

Definition 1: The temporal variation in fading is modeled as follows: 

\H^{n + 1) - Hi{n)\ = V^, for all n, i = 1, . . . , M, (2) 

where the V^s, are nonnegative random variables independent across time slots for each i. We assume that for each 
i, the random variables are uniformly bounded from above by v^, which we refer to as the maximum speed of 
fading. Under slow fading conditions, the distribution of V.^ is expected to be more concentrated around zero. 

Consider the non-fading case where the channel state vector is fixed. The capacity region of the Gaussian 
multiple-access channel with no power control is described as follows [23], 

Cg(P, H) = [r£R^^ ■.Y,R,<C(^Y. ^^^i' ^o) ,for all 5 C = {1, . . . , M}|, (3) 

where Pi and Ri are the i-th transmitter's power and rate, respectively. C{P, N) denotes Shannon's formula for 
the capacity of the AWGN channel given by 

C(P,iV) = ^log(l + ^) nats. (4) 

For a multiple-access channel with fading, but fixed transmission powers Pi, the throughput capacity region is 
given by averaging the instantaneous capacity regions with respect to the fading process [24], 



C,(P) = |i^eMf ■.^Ri<KH c(j]F,P„Afo) ,for all SQm\ 



(5) 



where H is a random vector with the stationary distribution of the fading process. 

A power control policy tt is a function that maps any given fading state h to the powers allocated to the 
transmitters Tv{h) = (7ri(ft,), . . . , ttm (^))- Similarly, we can define the rate allocation policy, TZ, as a function that 
maps the fading state h to the transmission rates, TZ{h). For any given power-control policy tt, the capacity region 
follows from Q as 



C/(7r) = |i? G M*^ - ^Ri c{^Hi'Ki{H),N^^ ,for all S C m\. 

^ i&s ies ^ ^ 



(6) 
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Tse et al. [5] have shown that the throughput capacity of a multiple access fading channel is given by 

C{P) = U Cfi-K), (7) 

where Q = {n : E//[7rj(i3")] < -Pi, for all i} is the set of all power control policies satisfying the average power 
constraint. Let us define the notion of boundary or dominant face for any of the capacity regions defined above. 

Definition 2: The dominant face or boundary of a capacity region, denoted by J^{-), is defined as the set of all 
M-tuples in the capacity region such that no component can be increased without decreasing others while remaining 
in the capacity region. 

III. Rate Allocation in a Non-fading Channel 

In this section, we address the problem of finding the optimal operation rates in a non-fading multiple-access 
channel. Without loss of generality, we fix the channel state vector to unity throughout this section, and denote the 
capacity region by a simpler notation Cg{P) instead of Cg{P, 1), where P > denotes the transmission power. 
Consider the following utility maximization problem for a M-user channel. 

maximize u{R) 

subject to Re Cg{P), (8) 

where Ri and Pi are i-th user rate and power, respectively. The utility function u{R) is assumed to satisfy the 
following conditions. 
Assumption 1: The following conditions hold: 

(a) The utility function u : M^^ ^ M is concave with respect to vector R. 

(b) u{R) is monotonically increasing with respect to Ri, for i = 1, . . . , M. 
Assumption 2: There exists a scalar B such that 

llfirll < B, for all g G du{R) and all R, 

where du{R) denotes the subdifferential of u at R, i.e., the set of all subgradients of u at R. 

Note that Assumption [2] is standard in the analysis of subgradient methods for non-differentiable optimization 
problems [25]. The maximization problem in ([D is a convex program and the optimal solution can be obtained 
by several optimization methods such as the gradient projection method. The gradient projection method with 
exact projection is typically used for problems where the projection operation is simple, i.e., for problems with 
simple constraint sets such as the non-negative orthant or a simplex. However, the constraint set in ([8]l is defined 
by exponentially many constraints, making the projection problem computationally intractable. To alleviate this 
problem, we use an approximate projection, which is obtained by successively projecting on violated constraints. 

Definition 3: Let X = {x W^\Ax < b}, where A has non-negative entries. Let y £ R" violate the constraint 
a'iX < hi, for i G {ii, . . . , ii}. The approximate projection of y on X, denoted by V, is given by 

where Vi^^ denotes the exact projection on the hyperplane {x G R"|a'.^a; = hi^}. 

'The vector g is a subgradient of a concave function / : 73 — > R at xq, if and only if f(x) — f{xo) < g'(x — xq) for all x € D. 
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Fig. 1. Approximate projection of it on a two-user MAC capacity region 



An example of approximate projection on a two-user multiple-access capacity region is illustrated in Figure [T] 
As shown in the figure, the result of approximate projection is not necessarily unique. In the following, when we 
write V, it refers to an approximate projection for an arbitrary order of projections on the violated hyperplanes. 
Although the approximate projection is not unique, it is pseudo-nonexpansive as claimed in the following Lemma. 

Lemma 1: The approximate projection V given by Definition |3] has the following properties: 

(i) For any y G M", 'P(y) is feasible with respect to set X, i.e., 'P(y) G X. 

(ii) V is pseudo-nonexpansive, i.e., 

Il^(y)- < ||y-y||, for all y G X (9) 
Proof: For part (i), it is straightforward to see that Vi{y) is given by (cf. [26] Sec. 2.1.1) 

a'^y - hi 
T^i[y) =y 11 — n— aj. 

Since has only non-negative entries, all components of y are decreased after projection and hence, the constraint 
i will not be violated in the subsequent projections. This shows that given an infeasible vector y G M", the 
approximate projection operation given in Definition |3] yields a feasible vector with respect to set X. 

Part (ii) can be verified by using the nonexpansiveness property of projection on a closed convex set (See 
Proposition 2.1.3 of [26]) for / times. Since y is a fixed point of Vi for all i, we have 

\\v{y)-y\\ = m,{...{vM))-rU---i'PUy 

< mA---i'PUy)))-VU...iVi,{y 

< \\y-y\\- (10) 

■ 

Here, we present the gradient projection method with approximate projection to solve the problem in dSjl. The 
k-th iteration of the gradient projection method with approximate projection is given by 

Rk+i =r(^Rk + a''g''), g''edu{R''), (11) 

where g'' is a subgradient of u at R^, and a'^ denotes the stepsize. Figure |2] demonstrates gradient projection 
iterations for a two-user multiple access channel. The following theorem provides a sufficient condition which can 
be used to establish convergence of (fTTI) to the optimal solution. 
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Fig. 2. Gradient projection metliod with approximate projection on a two-user MAC region 

Theorem 1: Let Assumptions [T] and |2] hold, and R* be an optimal solution of problem ([8]l. Also, let the sequence 
{R^} be generated by the iteration in (fTTl ). If the stepsize satisfies 



< < 



, 2{u{R*)-u{R^)) 



,A:||2 



then 



||^fe+l_^*|| ^ \\Rk 

Proof: We have 

\\R^ + aV - R*f = \\R^ - R*\? + 2a^{R^ - R*)'g'' + {a^f\\g''f. 
By concavity of n(-), we have 



(12) 
(13) 



R* - R^)'g^ >u{R*)-u{R''). 



(14) 



Hence, 



R^ + 0^0^ - R*f < WR'' -R 



* l|2 



,fcMI„fe||2 



If the stepsize satisfies ([T2l l. the above relation yields the following 

\\R^ + a^g^ - R*\\ < \\R'' - R*\\. 
Now by applying pseudo-nonexpansiveness of the approximate projection we have 

ll^fc+i -R*\\ = \\p(^Rk ^ ^fc^fc^ -R*\\< \\r'' + aV - < II-R'' - R*\ 



Theorem 2: Let Assumptions [T] and |2] hold. Also, let the sequence {-f?^} be generated by the iteration in (ITTI) . 
If the stepsize a'' satisfies ([T2l ). then {i?^} converges to an optimal solution R*. 

Proof: See Proposition 8.2.7 of [25]. ■ 

The convergence analysis for this method can be extended for different stepsize selection rules. For instance, we 
can employ diminishing stepsize, i.e.. 



a = oo, 



k=0 



or more complicated dynamic stepsize selection rules such as the path-based incremental target level algorithm 
proposed by Brannlund [27] which guarantees convergence to the optimal solution [25], and has better convergence 
rate compared to the diminishing stepsize rule. 
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A. Complexity of the Projection Problem 

Even though the approximate projection is simply obtained by successive projection on the violated constraints, 
it requires to find the violated constraints among exponentially many constraints describing the constraint set. In 
this part, we exploit the special structure of the capacity region so that each gradient projection step in (ITTI) can 
be performed in polynomial time in M. 

Definition 4: Let / : 2^ ^ R be a function defined over all subsets of M.. The function / is suhmodular if 

f{SUT) + f{SnT) < f{S) + f{T), for all 5, T G 2^. (15) 
Lemma 2: Define fc{S) : 2-^ ^ R as follows: 

fc{S) = C{Y,P^.No), for all 5 CA^. (16) 

If Pj > for all i G then fc{S) is submodular. Moreover, the inequality (fTSj) holds with equality if and 
only if 5 C T, or r C 5. 

Proof: The proof is simply by plugging the definition of /c(') in inequality ([T5] ). In particular, 



fc{S) + fc{T)-f{SuT)-fiSnT) = ilog 

= - log 
2 ^ 



{N0 + Z^esP^)iN0 + Y..eTP^) 
iNo + E^eSnTP^)iNo + E^eSUTP^) 
1 , T.{i,j)e{S\T)x{T\S) PiPj 



(No + E^eSnT P^)iNo + EieSuT Pi. 
> 0. (17) 

Since Pi > 0, the above inequality holds with equality if and only if 5 \ T = 0, or T \ 5 = 0. This condition is 
equivalent to either S or T contains the other. ■ 
Theorem 3: For any R G M^^, finding the most violated capacity constraint in Q can be written as a submodular 
function minimization (SFM) problem, that is unconstrained minimization of a submodular function over all S C A4. 
Proof: We can rewrite the capacity constraints of Cg{P) as 

fc{S) - ^ i?i > 0, for all S C M. (18) 

Thus, the most violated constraint at R corresponds to 

^* = arggmin fc{S) 

i&S 

By Lemma E] is a submodular function. Since summation of a submodular and a linear function is also 
submodular, the problem above is of the form of submodular function minimization. ■ 
It was first shown by Grotschel et al. [28] that an SFM problem can be solved in polynomial time. The are several 
fully combinatorial strongly polynomial algorithms in the literature. The best known algorithm for SFM proposed 
by Orlin [29] has running time O(M^). Note that approximate projection does not require any specific order for 
successive projections. Hence, finding the most violated constraint is not necessary for approximate projection. In 
view of this fact, a more efficient algorithm based on rate-splitting is presented in Appendix Jl to find a violated 
constraint. It is shown in Theorem [TT] that the rate-splitting-based algorithm runs in 0(M^ log M) time, where M 
is the number of users. 

Although a violated constraint can be obtained in polynomial time, it does not guarantee that the approximate 
projection can be performed in polynomial time. Because it is possible to have exponentially many constraints 
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violated at some point and hence the total running time of the projection would be exponential in M. However, 
we show that for a small enough stepsize in the gradient projection iteration (fTTl) . no more than M constraints can 
be violated at each iteration. Let us first define the notions of expansion and distance for a polyhedra. 
Definition 5: Let Q be a polyhedron described by a set of linear inequalities, i.e., 

Q = {x : Ax <h} . (19) 

Define the expansion of Q by 5, denoted by £s{Q), as the polyhedron obtained by relaxing all the constraints in 
( [T9l ). i.e., £s{Q) = {x eM."- : Ax < b + 61} , where 1 is the vector of all ones. 

Definition 6: Let X and Y be two polyhedra described by a set of linear constraints. Let £d{X) be an expansion 
of X by d as defined in Definition [S] The distance dH{X,Y) between X and Y is defined as the minimum scalar 
d such that X C £d{Y) and Y C £d{X). 

Lemma 3: Let fc be as defined in ( fT6l ). There exists a positive scalar 6 satisfying 

S<^ifc{S) + fciT)-fciSnT)-fciSuT)), for all 5, T G 2^ , SnT^S,T, (20) 

such that any point in the relaxed capacity region of an M-user multiple-access channel, £s{Cg{P)), violates no 
more than M constraints of Cg{P) defined in Q. 

Proof: Existence of a positive scalar 5 satisfying (l20l) follows directly from Lemma IH using the fact that 
neither S nor T contains the other one. 

Suppose for some R G £s{Cg{P)), there are M + 1 violated constraints of Cg{P). Since it is not possible to 
have M + 1 non-empty nested sets in 2-^ , there are at least two violated constraints corresponding to some sets 
5, T G 2^ where SnT^S,T, and 

-J2R^ < -fc{S), (21) 

ies 

-J^i?, < -fc{T). (22) 



Since R is feasible in the relaxed region. 



E 

i&snT 



Ri < fciSnT) + 6, (23) 
^ fc{SuT) + 6. (24) 



ieSUT 

Note that if S" n T = 0, reduces to < 5, which is a valid inequality. 
By summing the above inequalities we conclude 

5 > lifciS) + fc{T) - fciS n T) - fciS U T)), (25) 

which is a contradiction. ■ 
Theorem 4: Let Assumptions [U and |2] hold. Let i^i < P2 < • • • < Pm be the transmission powers. 
If the stepsize in the /c-th iteration ([TT]) satisfies 

P1P2 



a" < = log 



{N, + YJlUPd{No + Y.-LiPi)- 
then at most M constraints of the capacity region Cg{P) can be violated at each iteration step. 



(26) 
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Proof: We first show that inequaUty in ( [20b holds for the following choice of 5: 



log 



1 + 



P1P2 



iNo + Ef^3P^)iNo + EtUP^ 



M 



(27) 



In order to verify this, rewrite the right hand side of (1201 ) as 



1 



log 



log 



iN0 + E^esP^)iN0 + E 
{N0 + E^eSnTP^)iN0 + E^eSUTP^). 

^{i,j)eiS\T)xiT\S)PiPj 



1 + 

1 + 

1 + 

1 + 



{No + E^eSnTP^)iNo + E^eSuTP^) 

Pl^ 

iNo + E^^SnTP^mo + E^eSUTP^) 
P1P2 



{N0 + E^eSnTP^)iN0 + Ef^lP^ 
Pl^ 

iNo + Efi3P^)iNo + EfiiPi) 



The inequalities can be justified by using the monotonicity of the logarithm function and the fact that {S\T) x 
(T \ S) is non-empty because SnT^ S,T. 
Now, let R!^ be feasible in the capacity region, Cg{P). For every S" C A^, we have 

n-i ^ \\y W ] ' 

i&S 

< f{S) + 



i&S 



(28) 



< f{S)+6, 

where the first inequality follows from Assumption [TJb), Assumption |2l and Eq. (l26l ). The second inequality holds 
because for any unit vector d G M*^, it is true that 

^di<^\di\<^/M. (29) 

Thus, if a'' satisfies ^ then {R^ + a^g^) G £s{Cg{P)), for some 5 for which (|20l l holds. Therefore, by Lemma 
[3] the number of violated constraints does not exceed M. ■ 

In view of the fact that a violated constraint can be identified in 0(M^ log M) time (see the Algorithm in 
Appendix IT]), Theorem |4] implies that, for small enough stepsize, the approximate projection can be implemented 
in 0(M3 log M) time. 

In section |Vl we will develop algorithms that use the gradient projection method for dynamic rate allocation in 
a time varying channel. 



IV. Dynamic Rate and Power Allocation in Fading Channel with Known Channel Statistics 

In this section, we assume that the channel statistics are known. Our goal is to find feasible rate and power 
allocation policies denoted by IZ* and tt*, respectively, such that TZ*{H) G Cg[Tv*{H), H^, and tt* G Q. Moreover, 



Ef^[7^*(i^)] =R* £ argmax u{R), subject to R £ C{P), 
where n(-) is a given utility function and is assumed to be differentiable and satisfy Assumption [T] 



(30) 
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For the case of a linear utility function, i.e., u{R) = fi'R for some /i G Tse et al. [5] have shown that the 
optimal rate and power allocation policies are given by the optimal solution to a linear program, i.e., 

(TZ* (h) , TV* (h)) = argmax (fx'r — X'p) , subject to re C„(p,h), (31) 

r,p 

where h is the channel state realization, and A G R:'|_^ is a Lagrange multiplier satisfying the average power 
constraint, i.e., A is the unique solution of the following equations 

T 11^^ n\ fAT I — TTT u: fiWdhdz = Pi, (32) 

where and are, respectively, the cumulative distribution function (CDF) and the probability density function 
(PDF) of the stationary distribution of the channel state process for transmitter k. 

Exploiting the polymatroid structure of the capacity region, problem (|3T] ) can be solved by a simple greedy 
algorithm (see Lemma 3.2 of [5]). It is also shown in [5] that, for positive /i, the optimal solution, R*, to the 
problem in (l30l) is uniquely obtained. Given the distribution of channel state process, denoted by Fk and fk, we 
have 

The uniqueness of R* follows from the fact that the stationary distribution of the channel state process has a 
continuous density [5]. It is worth mentioning that (|33] ) parametrically describes the boundary of the capacity region 
which is precisely defined in Definition |2] Thus, there is a one-to-one correspondence between the boundary of 
C{P) and the positive vectors with unit norm. 

Now consider a general concave utility function satisfying Assumption [T] It is straightforward to show that R*, 
the optimal solution to (l30l ). is unique. Moreover, by Assumption [TJb) it lies on the boundary of the throughput 
region. Now suppose that R* is given by some genie. We can choose /i* = Vu{R*) and u{R) = (/i*)'i?, as 
a replacement for the nonlinear utility. By checking the optimality conditions, it can be seen that R* is also the 
optimal solution of the problem in (l30l ). i.e., 

i^*= argmax (/x*)'i^ subject to ReC{P). (34) 

Thus, we can employ the greedy rate and power allocation policies in (OTI ) for the linear utiUty function u(-), and 
achieve the optimal average rate for the nonlinear utiUty function u{-). Therefore, the problem of optimal resource 
allocation reduces to computing the vector R* . Note that the throughput capacity region is not characterized by 
a finite set of constraints, so standard optimization methods such as gradient projection or interior-point methods 
are not applicable in this case. However, the closed-form solution to maximization of a linear function on the 
throughput region is given by (1331 ). This naturally leads us to the conditional gradient method [26] to compute R* . 
The /c-th iteration of the method is given by 

h ~k 

where a is the stepsize and R is obtained as 

r!' G argmax (vu{R''y{R - R'')) , (36) 



LIDS REPORT 2787 



12 



where Vu{R^) denotes the gradient vector of u{-) at R^. Since the utiUty function is monotonically increasing 
by Assumption (Hb), the gradient vector is always positive and, hence, the unique optimal solution to the above 
sub-problem is obtained by (1331 ). in which fi is replaced by \/u{R^). By concavity of the utility function and 
convexity of the capacity region, the iteration (1351 ) will converge to the optimal solution of (l30l ) for appropriate 
stepsize selection rules such as the Armijo rule or limited maximization rule (cf. [26] pp. 220-222). 

Note that our goal is to determine rate and power allocation policies. Finding R* allows us to determine such 
policies by the greedy policy in (|3TI ) for /x* = Vu{R*). It is worth mentioning that all the computations for 
obtaining R* are performed once in the setup of the communication session. Here, the convergence rate of the 
conditional gradient method is generally not of critical importance. 

V. Dynamic Rate Allocation without Knowledge of Channel Statistics 

In this part we assume that the channel statistics are not known and that the transmission powers are fixed to P. 
In practice, this scenario occurs when the transmission power may be limited owing to environmental limitations 
such as human presence, or limitations of the hardware. 

The capacity region of the fading multiple access channel for this scenario is a polyhedron given by ([Sjl. Similarly 
to the previous case, the goal is to find an optimal rate allocation policy Tl*{-) with respect to a given utiUty function, 
which we formally define next. 

Definition 7: [Optimal Policy] The optimal rate allocation policy denoted by Tl*{-) is a mapping that satisfies 
TZ*{H) G Cg{P,H) for all H, such that 

Eh['R-*{H)]= R* G argmax u{R) 

subject to ReCaiP). (37) 
It is worth noting that the approach used to find the optimal resource allocation policies for the case with known 

channel statistics does not apply to this scenario, because Cg{P,h) is a polyhedron and hence, unlike in Section 

HVl the uniqueness of the optimal solution, R* for any positive vector does not hold anymore. 

Here we present a greedy rate allocation policy and compare its performance with the unknown optimal policy. 

The performance of a particular rate allocation poUcy is defined as the utiUty function evaluated at the average rate 

achieved by that policy. 

Definition 8: [Greedy Policy] A greedy rate allocation policy, denoted by 1Z, is given by 

1Z{H) = argmax u{R) 

subject to R&Cg{P,H) (38) 

i.e., for each channel state, the greedy policy chooses the rate vector that maximizes the utility function over the 
corresponding capacity region. 

The utility function u{R) is assumed to satisfy the following conditions. 

Assumption 3: For every 5 > 0, let Ms = |-ff : dH{Cg{P, H), Ca{P)) < S^. The following conditions hold: 
(a) There exists a scalar B{d) such that for all H G Ms, 

\u{Ri) -u{R2)\ < B{S)\\Ri - R2\\, for all Ri,\\Ri\\ > Ds,i = 1,2, 

where 

Ds= inf sup \\R\\. (39) 
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(b) There exists a scalar A{6) such that for all H G Ms, 

\u{TZ{H)) -u{R)\> A{6)\\'JZ{H) - Rf, for all ReCg{P,H). 
Assumption |3la) is a weakened version of Assumption |2j which imposes a bound on subgradients of the utility 
function. This assumption only requires bound on the subgradient in a neighborhood of the optimal solution and 
away from the origin, which is satisfied by a larger class of functions. Assumption [3lb) is a strong concavity type 
assumption. In fact, strong concavity of the utility implies Assumption Ob), but it is not necessary. The scalar A{6) 
becomes small as the utility tends to have a linear structure with level sets tangent to the dominant face of the 
capacity region. Assumption |3] holds for a large class of utility functions including the well known a-fair functions 
given by 



u{x) = \ ; (40) 

[ log(a;), a = 1, 

which do not satisfy Assumption |2] 

Note that the greedy policy is not necessarily optimal for general concave utility functions. Consider the following 
relations 

EH[u{n*iH))] < KH[u{n{H))] 

< u{EH[n{H)]) 

< u{EH[n*{H)]), (41) 

where the first and third inequality follow from the feasibility of the optimal and the greedy poUcy for any channel 
state, and the second inequality follows from Jensen's inequaUty by concavity of the utility function. 

In the case of a linear utility function we have ^i(EH■[7^*(i^)]) = E^/[n(7^*(i3"))], so equality holds throughout 
in (1411) and 1Z{-) is indeed the optimal rate allocation policy. For nonlinear utility functions, the greedy policy can 
be strictly suboptimal. 

However, the greedy policy is not arbitrarily worse than the optimal one. In view of (|4TI ). we can bound the perfor- 
mance difference, M(i?*)-u(Eff [:^(i3")] ), by bounding ^i(Ef/[7^*(i^)])-n(E^^[•^(i^)]) or u{EH['Jl*iH)])- 
Efj[u{T^* {H))~\ from above. We show that the first bound goes to zero as the channel variations become small 
and the second bound vanishes as the utility function tends to have a more linear structure. 

Before stating the main theorems, let us introduce some useful lemmas. The first lemma asserts that the optimal 
and greedy policies assign rates on the dominant face of the capacity region. 

Lemma 4: Let ti(-) satisfy Assumption [TJb). Also, let Tl*{-) and 1Z{-) be optimal and greedy rate allocation 
policies as in Definitions |7] and [8j respectively. Then, 

(a) TZiH) £ T{Cg{P, H)) , for all H. 

(b) Pr{H : n*iH) G .^(^^(P, i/)) } = 1. 

where J^(-) denotes the dominant face of a capacity region (cf. Definition |2l). 

Proof: Part (a) is direct consequence of Assumption (Hb) and Definition |2] If the optimal solution to the 
utility maximization problem is not on the dominant face, there exists a user i such that we can increase its rate 
and keep all other user's rates fixed while staying in the capacity region. Thus, we are able to increase the utility 
by Assumption [Hb), which leads to a contradiction. 
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For part (b), first note that with the same argument as above we have 

R*=EH[n*{H)] G r{Ca{P)). 
From Definition |2] and the definition of throughput capacity region in ([5]l, we have 



E 



H 



M 



AI 



H) 



E 



H 



i=l 



i=l 



(42) 



(43) 



Thus, Zfii 'KiH) = C{ Zfii HiP^, No), with probabiUty one, because C{Zfii HiPi, Nq) -^Zi T^liH) > 
0, for all H. Therefore, by definition of MAC capacity region in ([3]) we conclude TZ*{H) E !F{Cg{P, H)), with 
probability one. ■ 

The following lemma extends Chebyshev's inequality for capacity regions. It states that, with high probability, 
the time varying capacity region does not deviate much from its mean. 

Lemma 5: Let H he a random vector with the stationary distribution of the channel state process, mean H and 
covariance matrix K. Then 

2 

Pr{dH {Cg{P,H),Ca{P)) >6}<^, 

where aj^ is defined as 



^2 A 



SC{1,...,A/} 



{l + T'sH){J2log{l + T'sH) 



(44) 



(45) 



where 



No 

0, 



otherwise. 



(46) 



Proof: See Appendix HIl ■ 
The system parameter afj in Lemma [5] is proportional to channel variations, and we expect it to vanish for very 
small channel variations. The following lemma ensures that the distance between the optimal solutions of the utihty 
maximization problem over two regions is small, provided that the regions are close to each other. 

Lemma 6: Let the utility function, u : M^^ —>■ M, satisfy Assumptions [U and [3l Also, let Rl and R2 be the 
optimal solution of maximizing the utility over Ca{P) and Cg{P,H), respectively. If 



dH{Cg{P,H),Ca{P)) <S, 



then we have 



R*2\\ < 



^\A{5) 



(47) 



Proof: See Appendix Hill 

The following theorem combines the results of the above two lemmas to obtain a bound on the performance 
difference of the greedy and the optimal policy. 

Theorem 5: Let u : M*^ — > M4. satisfy Assumptions [T] and [3] Also, let TZ*{-) and be optimal and greedy 
rate allocation policies as in Definitions |7] and [8j respectively. Then for every 5 G [a'jj, 00), 

where R* = Eh\TI* {H)], and A{5) and B{5) are positive scalars defined in Assumption |3] 



2 2 
u{R*) - u{EH[n{H)\) < ^u{R*) + (1 - ^)b{5) 



(48) 
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Proof: Pick any 5 G [cj|^, oo). Define tiie event, V as 



V = : dH{Cg{P,H),Ca{P)) < 



By Lemma [21 tlie probability of this event is at least 1 — Using Jensen's inequality as in (|4TI) we can bound 
the left-hand side of (l48l) as follows 



V 



u{R*) - u{EH[n{H)]) < u{R*) -EH[u{n{H))] 



u{R*) - (1 - ^)Kh u{1l{H)) 



-Pr(V'=)Ejj u{n{H)) 



< ^u{R*) + (1 - ^) ( u{R*) - Kh HnHW] 



6^ 



< 



< 



'^uiw) + (1 - ^; 



E 



H 



u{R*) - u{n{H))\V 



H 



u{R*) + (1 



H 



E 



H 



\u{R*) - u{n{H))\ 



V 



(49) 



52 ' ' ' 52 

In the above relations, the first inequality follows from the fact that Pr(V) > 1 — and the second inequality 
holds because of the non-negativity of u{R). 

On the other hand, by incorporating Lemma |4] in Assumption [3ta) we have 

\u{R*) - u{TZ{H))\ < B{d)\\n{H) - R*\\. 

Now by Assumption [3] we can employ Lemma |6] to conclude the following from the above relation: 

\uiR*) - u{n{H))\ < i?(5)(ji + (||^)')<5i, for all H,dH{CgiP,H),Ca{P)) < 5, 



which implies 



E 



H 



\u{R*) - u{n{H))\ 



V 



(50) 



The desired result follows immediately from substituting (l50l) in ( |49l ). 



Theorem |5] provides a bound parameterized by S. For very small channel variations, an becomes small. Therefore, 
the parameter 5 can be picked small enough such that the bound in (l48l ) tends to zero. Figure|3]illustrates the behavior 
of right hand side of Eq. (1481 ) as a function of 6 for different values of an- For each value of an, the upper bound 
is minimized for a specific choice of 6, which is illustrated by a dot in Figure [3l As demonstrated in the figure, 
for smaller channel variations, a smaller gap is achieved and the parameter 6 that minimizes the bound decreases. 

The next theorem provides another bound demonstrating the impact of the structure of the utility function on the 
performance of the greedy policy. 

Theorem 6: Let Assumption [T] hold for the twice differentiable function u : M^^ M-|-. Also, let Tl*{-) and 
1Z{-) be the optimal and the greedy rate allocation poUcies, defined in Definitions |7] and [H respectively. Then for 
every e G (0, 1], 



liR*) - u{EH[n{H)]) < eu{R*) + -(1 - e)r{efn, 



(51) 
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Fig. 3. Parametric upper bound on performance difference between greedy and optimal policies as in right hand side of ( I48t for different 
channel variations, an, as a function of S 



where R* = Eff[7^*(i?)], and ft satisfies the following 

Amax( - VMO) < ^, for all t U -R*\\< r{e) 
in which denotes the Hessian of u, and r(e) is given by 



r(e) = Vm^ + 



(52) 



M 






i=l 





V = |/f : dH{Cg{P,H),Ca{P)) < 



(53) 



(54) 



By Lemma \5\ this event has probability at least 1 — e. Lemma |4] asserts that the optimal policy almost surely 
allocate rate vectors on the dominant face of Cg{P,H). Therefore, for almost all H £ V, the optimal policy 
satisfies the following 



E 



H 



1, f, , HjPj 



-log [l + HiPi 



+ 



0\H_ 



(55) 



Thus, for almost all H £ V, we have 



Therefore, 



\n*{H)-R*\ <^+Eh 



1 ( il + H,Pi){l + Zj^^H,P,) 



\n*{H)-R*\\ < Vm^ + 



M 











r(e), for almost all ii" G V. 



(56) 
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Now let us write the Taylor expansion of n(-) at R* in the direction of R, 

u{R) = u{R*) + Vu{R*y{R-R*)-^{R-R*y{-V\{0){R-R*) 

> u{R*) + Vu{R*y{R - R*) -\\\R- R*fXm..{-VMO) 

for some ^, < (57) 

In the above relation, let R = TZ* {H) for all G V. The utility function is concave, so its Hessian is negative 
definite and we can combine (l56l ) with the above relation to write 

u{n*{H)) > u{R*) + Vu{R*y{n*{H) - R*) - ^r{efn, for almost all H eV. (58) 

Taking the expectation conditioned on V, and using the fact that Tl*{H) G J^(Cg{P, H)) we have the following 

EH[u{n*{H))\V] > u{R*) - K{efn. (59) 



Hence, we conclude 



uiR*) - u(Ef^(7^ (//))) < u{R*) - E^^[^z(7^*(i^))] 

< ^z(i^*)-(l-e)E^^L(7^*(i^)) 



V 



-Pr{V')EH u{n*{H)) 



< u{R*)-{l-e){u{R*)--r{ef^) 



V 
1 
2 



= eu{R*) + \{l-e)r{efn. 

where the first inequality is verified by (|4T]) . and the third inequality follows from non-negativity of the utility 
function and the inequality in (|59l ). ■ 

Similarly to Theorem [51 Theorem |6] provides a bound parameterized by e. As the utility function tends to have 
a more linear structure, Q. tends to zero. For instance, i7 is proportional to a for a weighted sum a-fair utility 
function. Hence, we can choose e small such that the right hand side of (ISTI ) goes to zero. The behavior of this 
upper bound for different values of J7 is similar to the one plotted in Figure [3] 

In summary, the performance difference between the greedy and the optimal policy is bounded from above by 
the minimum of the bounds provided by Theorem |5] and Theorem [6] 

Even though the greedy policy can perform closely to the optimal policy, it requires solving a nonlinear program 
in each time slot. For each channel state, finding even a near-optimal solution of the problem in (1381) requires a 
large number of iterations, making the online evaluation of the greedy policy impractical. In the following section, 
we introduce an alternative rate allocation policy, which implements a single gradient projection iteration of the 
form ([TTI) per time slot. 



A. Approximate Rate Allocation Policy 

In this part, we assume that the channel state information is available at each time slot n, and the computational 
resources are limited such that a single iteration of the gradient projection method in ([TT]) can be implemented in 
each time slot. In order to simplify the notation in this part and avoid unnecessary technical details, we consider a 
stronger version of Assumption Hb). 
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Assumption 4: Let H) = argmaxj^^^Q^^^p fj-^u{R). Then there exists a positive scalar A such that 



|n(i^"^) - u{R)\ > - Rf, for all R € Cg{P, H). 



Definition 9: [Approximate Policy] Given some fixed integer A; > 1, we define tiie approximate rate allocation 
policy, TZ, as follows: 



n{H{n)) 



n{H{o)), 



R 



■t(n) ' 



where 



and 



n = 
n > 1, 



n 



n] 



(60) 



(61) 



is given by the following gradient projection iterations: 



-"-i(n) 



Pi 



t(n) 



n(H{kt{ 



n 



P 



t{n) 



t{n) 



3 



1, 



,A;-1, 



(62) 



where sf^j-^^ is a subgradient of u{-) at I^t{n)' ct"' denotes the stepsize and Pt{n) is the approximate projection on 
Cg{P,H{kt{n))). 

For k = 1, (l62l ) reduces to taking only one gradient projection iteration at each time slot. For k > 1, the proposed 
rate allocation policy essentially allows the channel state to change for a block of k consecutive time slots, and 
then takes k iterations of the gradient projection method with the approximate projection. We will show below that 
this method tracks the greedy policy closely. Hence, this yields an efficient method that on average requires only 
one iteration step per time slot. Note that to compute the policy at time slot n, we are using the channel state 
information at time slots kt,k(t — 1), . . .. Hence, in practice the channel measurements need to be done only every 
k time slots. 

There is a tradeoff in choosing system parameter k, because taking only one gradient projection step may not 
be sufficient to get close enough to the greedy poUcy's operating point. Moreover, for large k the new operating 
point of the greedy policy can be far from the previous one, and k iterations may be insufficient. 

Before stating the main result, let us introduce some useful lemmas. In the following lemma, we translate the 
model in Definition \T\ for temporal variations in channel state into changes in the corresponding capacity regions. 

Lemma 7: Let { [-f^i(n)]i=i^..._M} be the fading process that satisfies condition in We have 

dH{Cg{P,H{n + l)),Cg{P,H{n))) <Wn, (63) 

where are non-negative independent identically distributed random variables bounded from above hy w = 

\ YldLi ^^Pi' where is a uniform upper bound on the sequence of random variables {V^} and Pi is the i-th 
user's transmission power. 
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Proof: By Definition [6] we have 

dH [Cg {P, H{n + 1)) , Cg (P, H{n 



1 

max — 

SCIM 2 



log 1 + 



< max 



E^es\H^in + l)-H,in)\P 



SCM 2{l + Y.^esHi{n)Pi, 



^ M ^ M 



(64) 



i=l 



i=l 



Therefore, ^ is true for Wn = \ Ef=i KPi- 

Since the random variables are i.i.d. and bounded above by v\, 



the random variables Wn are i.i.d. and bounded from above by ^ "YldLi v^Pi- 



The following useful lemma by Nedic and Bertsekas [30] addresses the convergence rate of the gradient projection 
method with constant stepsize. 

Lemma 8: Let rate allocation policies 1Z and TZ be given by Definition [8] and Definition |9l respectively. Also, let 
Assumptions [I] |2] and |4] hold and the stepsize a" be fixed to some positive constant a. Then for a positive scalar 
e we have 

n(n{H{n))) > u(n{H{kt))) - (65) 

if k satisfies 



k > 



\Rt -n{H{kt))\ 



ae 



(66) 



Proof: See Proposition 2.3 of [30]. 

We next state our main result, which shows that the approximate rate allocation policy given by Definition |9] 
tracks the greedy policy within a neighborhood which is quantified as a function of the maximum speed of fading, 
the parameters of the utility function, and the transmission powers. 

Theorem 7: Let Assumptions [TJ |2] and |4] hold and the rate allocation policies TZ and TZ be given by Definition [8] 
and Definition |9j respectively. Choose the system parameters k and a for the approximate policy in Definition |9] as 



k 



,2B_ 
'Aw' 



a 



16A\3 2 

^1 -'^ 



where w' = w ^ {w 2 -\- (^^) ^ w i?, the upper bound on Wn as defined in Lemma |7j A and B are constants given 
in Assumptions |4] and [2] Then, we have 



Proof: First, we show that 



\n{H{n)) -1Z{H{n))\\ < 29 = 2(^^Yw'"^. 



\n{H{n)) - 1l{H{kt)) \\<9= 



where t = L^V^J- The proof is by induction on t. For t = the claim is trivially true. Now suppose that 
true for some positive t. Hence, it also holds for n = k{t + 1) by induction hypothesis, i.e., 

\\Fil^^-n{H{kt))\\ < e. 

On the other hand, by Lemma |7] implies that for every n, 

dH{Cg{P,H{n + l)),Cg{P,H{n))) <w. 



(67) 



(68) 
is 



(69) 
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Thus, by Lemma [6] and the triangle inequaUty we have 

\\1Z{H{k{t + 1))) - n{H{kt)) II < kw' < 9. 
Therefore, by another triangle inequality we conclude from ( [69l ) and dTOl ) that 



||^°+i - n{H{k{t + 1))) II < 29. 

After plugging the corresponding values of a and 9, it is straightforward to show that 
Thus, we can apply Lemma [8] to show 



u{n(H{n)) ) - u(n(H{k{t + 1)) 



By Assumption |4] we can write 

\\n{H{n)) - n{H{k{t + 1))) II < 

Therefore, the proof of (1681 ) is complete by induction. 
Again by applying Lemma |6] and Lemma |7] we have 



aB 



2x i 



\n{H{n)) -n{H{kt))\\ < kw' < 



holds for e 



(70) 
(71) 

(72) 
(73) 



(74) 



and the desired result directly follows from (|68] ) and (1741 ) by the triangle inequality. ■ 
Theorem |7] provides a bound on the size of the tracking neighborhood as a function of the maximum speed 
of fading, denoted by w, which may be too conservative. It is of interest to provide a rate allocation policy and 
a bound on the size of its tracking neighborhood as a function of the average speed of fading. The next section 
addresses this issue. 



B. Improved Approximate Rate Allocation Policy 

In this section, we design an efficient rate allocation policy that tracks the greedy policy within a neighborhood 
characterized by the average speed of fading which is typically much smaller than the maximum speed of fading. 
We consider policies which can implement one gradient projection iteration per time slot. 

Unlike the approximate policy given by (l60l ) which uses the channel state information once in every k time 
slots, we present an algorithm which uses the channel state information in all time slots. Roughly speaking, this 
method takes a fixed number of gradient projection iterations only after the change in the channel state has reached 
a certain threshold. 

Definition 10: [Improved Approximate Policy] Let be the sequence of non-negative random variables as 

defined in Lemma |7J and 7 be a positive constant. Define the sequence {Tj} as 

To = 0, 

t-i 

(75) 



n=T, 

Define the improved approximate rate allocation poUcy, TZ, with parameters 7 and k, as follows: 



^, , . { n(H(0)), n = 

n{H{n)) ^{ \ (76) 



Rt(n)^ n > 1, 
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— 1 — h- 


H 


1 — 1 — 1 — 1 — ^ 


— 1 — 1 — 1 — 1 — 1 — 1 — 



To = Tt{n) n '7<(„)+i 



Fig. 4. The improved approximate policy takes k gradient projection iterations at time Ttj^j, whicli is the time that the random walk 
generated by the random variables Wn reach the threshold 7. 



where 



t{n) = max{z | Tj < n}, (77) 
r = ar^max ui^F^^^^^, (78) 



and -Rjj-^) G is given by the following gradient projection iterations 



J^t(n) — Pt(n) 
f^t{n) = Pt(n) 



1,...,A:-1, (79) 



't(n)\ ' 

where g^^-^j is a subgradient of u{-) at denotes the stepsize and Pt{n) is the approximate projection on 

C,(P,i/(T,(„))). 

Figure m depicts a particular realization of the random walk generated by Wn, and the operation of the improved 
approximate policy. 

Theorem 8: Let t{n) be as defined in dTT] ). and let w = E[W„]. If A; = then we have 

lim —r-TT = 1; with probability 1. (80) 

n-^oo t(n)k 

Proof: The sequence {Tj} is obtained as the random walk generated by the Wn crosses the threshold level 7. 

Since the random variables Wn are positive, we can think of the threshold crossing as a renewal process, denoted 

by N{-), with inter-arrivals Wn- 

We can rewrite the limit as follows 

n-N{t{n)^)+N{t{n)^) n-N{t{n)j) _N{t{nh) 

lim ^ ^ -= lim -+w — — -. (81) 

t{n)k t{n)k t{n)^ 

Since the random walk will hit the threshold with probability 1, the first term goes to zero with probability 1. 
Also, by Strong law for renewal processes the second terms goes to 1 with probability 1 (see [31], p. 60). ■ 

Theorem [8] essentially guarantees that the number of gradient projection iterations is the same as the number of 
channel measurements in the long run with probability 1. 

Theorem 9: Let Assumptions [TJ |2] and |4] hold and the rate allocation policies 1Z and TZ be given by Definition [8] 
and Definition [TOl respectively. Also, let = [^J, and fix the stepsize to a = in ( 1791 ). where 7 = c{^)iivi, 
and c > 1 is a constant satisfying the following equation 

(c^ - 1)8 
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Then 



\n{H{n)) - n{H{n)) || < 27 + /"^V 



Proof: We follow the line of proof of Theorem |7J First, by induction on t we show that 

\\TZ{H{n))-n{H{Tt))\\<j, 
where t is defined in (TTTl) . The base is trivial. Similar to ( [69l ). by induction hypothesis we have 

\\R',^,-n{H{Tt))\\<^. 

By definition of Tj in (1751 ) we can write 

dH (P, //m+l)) , Cg (P, H{Tt)') j < 7. 

Thus, by Lemma [6l we have 

(Ti+i)) - ^(/f (Ti)) II < 7^ (7^ + (|) 



Therefore, by combining (1851) and (1871) by triangle inequality we obtain 



p?+i-^(/f(ri+i))||<27 + 



V .4 y 

Using the fact that w < w = ^ ^sg^ » ^fter ^ few steps of straightforward manipulations we can show that 



lA+i 



^(/f(r,+,))f <(27+(^)') <c' 



A 



Now by plugging the values of a and 7 in terms of system parameters in (I66b . we can verify that 



> 



^0 



R^^,-n{H{Tt+,))\ 



Hence, we can apply Lemma [8] for e = A'y^, and conclude 



u 



[n{H{n)) 
By exploiting Assumption |4] we have 



n{H{Tt+i)) 



\\n{H{n))-n{H{Tt+i))\\ < 



aB 



2v i 



7- 



(83) 



(84) 



(85) 



(86) 



(87) 



(88) 



(89) 



(90) 



(91) 



(92) 



Therefore, the proof of ( [841 ) is complete by induction. Similarly to ( [87] ) we have 

\\1Z{H{n)) - n{H{Tt)) II < 7^ (7' + (f )') ' (93) 

and ([83]) follows immediately from ([Mb and ( [93] ) by invoking triangle inequality. ■ 
Theorem [8] and Theorem [9] guarantee that the presented rate allocation policy tracks the greedy policy within a 
small neighborhood while only one gradient projection iteration is computed per time slot, with probability 1 . The 
neighborhood is characterized in terms of the average behavior of temporal channel variations and vanishes as the 
fading speed decreases. 
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Fig. 5. Structure of the i-th transmitter and the receiver for the queue-length-based policy [10]. 



H. (n) 



Transmitter 



R.(n) 



Receiver 



Fig. 6. Structure of the i-th transmitter and the receiver for the presented policies. 



VI. Simulation Results and Discussion 

In this section, we provide simulation results to complement our analytical results and make a comparison with 
other fair resource allocation algorithms. We focus on the case with no power control or knowledge of channel 
statistics. We also make reasonable assumption that the channel state processes are generated by independent 
identical finite state Markov chains. We consider weighted a-fair function as the utility function, i.e., 

u{R) = Y,Wifa{Ri). (94) 

i 

where /a(-) is given by equation ( |40l ). 

We consider two different scenarios to compare the performance of the greedy policy with the queue-based rate 
allocation pohcy by Eryiknaz and Srikant [10]. This pohcy, parameterized by some parameter K, uses queue length 
information to allocate the rates arbitrarily close to the optimal policy by choosing K large enough. As illustrated 
in Figure |6l Xi (n) denotes the queue-length of the i-th user. At time slot n, the scheduler chooses the service rate 
vector /x(n) based on a max-weight policy, i.e., 

M 

fx{n) = argmax Xi{n)Ri 

i=l 

subject to R£Cg{P,H{n)) (95) 

The congestion controller proposed in [10] leads to a fair allocation of the rates for a given a-fair utility function. 
In particular, the data generation rate for the i-th transmitter, denoted by aj(n) is a random variable satisfying the 
following conditions: 

E[ai{n) \xi{n)] = min | ^ °,^|, 

E[afin) \xi{n)] < U < oo, for all Xi(n), (96) 
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Fig. 7. Performance comparison of greedy and queue-based policies for a communication session with limited duration, for ^ = 1.22. 




Fig. 8. Performance comparison of greedy and queue-based policies for a communication session with limited duration, for ^ = 0.13. 

where a, D and U are positive constants. 

In the first scenario, we compare the average achieved rate of the poUcies for a communication session with 
limited duration. Figure |7] depicts the distance between empirical average rate achieved by the greedy or the queue- 
length based policy, and R*, the maximizer of the utility function over the throughput region. In this case, the 
utility function is given by (l94l ) with a = 2 and wi = l.5w2 = 1.5, and the corresponding optimal solution 
is R* = (0.60,0.49). As observed in Figure |71 the greedy poUcy outperforms the queue-length based policy a 
communication session with limited duration. It is worth noting that there is a tradeoff in choosing the parameter 
K of the queue-length based policy. In order to guarantee achieving close to optimal rates by queue-based policy, 
the parameter K should be chosen large which results in large expected queue length and lower convergence rate. 
On the other hand, if K takes a small value to improve the convergence rate, the achieved rate of the queue based 
policy converges to a larger neighborhood of the R*. 

As established in Theorem|5l the performance of the greedy policy improves by decreasing the channel variations. 
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Fig. 9. Performance comparison of greedy and queue-based policies for file upload scenario with respect to file size f ~ fi ~ f2- Rg and 
Rq are expected upload rate of the greedy and the queue-length based policy, respectively. 

Figure [8] demonstrates the improvement in performance of the greedy poUcy when ^ decreases from 1.22 to 0.13. 
We also observe in Figure [8] that the queue-length based policy is not sensitive to channel variations, and its 
performance does not improve by decreasing the channel variations. It is worth mentioning that the greedy policy 
as observed in the simulation results performs significantly better than the bounds provided by Theorems [5] and [6] 
These upper bounds characterize the behavior of the greedy policy in terms of channel variations and structure of 
the utility function, but they are not necessarily tight. 

Second, we consider a file upload scenario where each user transmitting a file with finite size to the base station 
in a rateless manner. Let % be the i-th user's completion time of the file upload session for a file of size /j. Define 
the average upload rate for the i-th user as ^. We can measure the performance of each policy for this scenario 
by evaluating the utility function at the average upload rate. Figure |9] demonstrates the utility difference of the 
greedy and the queue-based policy for different file sizes. We can observe that for small file sizes the greedy policy 
outperforms the queue-based policy significantly, and this difference decreases by increasing the file size. We can 
interpret this behavior as follows. The files are first buffered into the queues based on the queue lengths and the 
weighted a-fair utility, while the queues are emptied by a max-weight scheduler. Once the files are all buffered in 
the queues, the queues are empties with the same rate which is not fair because it does not give any priority to the 
users based on their utility. For larger file size, the duration for which the entire file is emptied into the queue is 
negligible compared to the total transmission time, and the average upload rate converges to a near-optimal rate. 

VII. Conclusion 

We addressed the problem of optimal resource allocation in a fading multiple access channel from an information 
theoretic point of view. We formulated the problem as a utility maximization problem for a more general class of 
utility functions. 

We considered several different scenarios. First, we considered the problem of optimal rate allocation in a non- 
fading channel. We presented the notion of approximate projection for the gradient projection method to solve the 
rate allocation problem in polynomial time in the number of users. 
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Second, we studied rate and power allocation in a fading channel with known channel statistics. In this case, 
the optimal rate and power allocation policies are obtained by greedily maximizing a properly defined linear utility 
function. If for the fading channel power control and channel statistics are not available, the greedy policy is not 
optimal for nonlinear utility functions. However, we showed that its performance in terms of the utility is not 
arbitrarily worse compared to the optimal policy, by bounding their performance difference. The provided bound 
tends to zero as the channel variations become small or the utility function behaves more linearly. 

The greedy policy may itself be computationally expensive. A computationally efficient algorithm can be em- 
ployed to allocate rates close to the ones allocated by the greedy policy. Two different rate allocation policies are 
presented which only take one iteration of the gradient projection method with approximate projection at each time 
slot. It is shown that these policies track the greedy policy within a neighborhood which is characterized by average 
speed of fading as well as fading speed in the worst case. 

Appendix I 

Algorithm for finding a violated constraint 

In this section, we present an alternative algorithm based on rate-splitting idea to identify a violated constraint for 
an infeasible point. For a feasible point, the algorithm provides information for decoding by successive cancellation. 
We first introduce some definitions. 

Definition 11: The quadruple {M,P,R,Nq) is called a configuration for an M-user multiple-access channel, 
where R = {Ri, . . . ,Rm) is the rate tuple, P = {Pi, . . . , Pm) represents the received power and A'^o is the noise 
variance. For any given configuration, the elevation, 6 G M*^, is defined as the unique vector satisfying 

Ri = C{P^,No + S^), i = l,...,M. (97) 
Intuitively, we can think of message i as rectangles of height Pi, raised above the noise level by 6i. In fact, 8i is 
the amount of additional Gaussian interference that message i can tolerate. Note that if the rate vector corresponding 
to a configuration is feasible its elevation vector is non-negative. However, that is not sufficient for feasibility check. 
Definition 12: The configuration (M, P, R, Nq) is single-user codable, if after possible re-indexing, 

6i+l>6i + P^,i = 0,l,...,M-l, (98) 

where we have defined 6o = Pq = for convention. 

By the graphical representation described earlier, a configuration is single-user codable if the none of the messages 
are overlapping. Figure [TOl a) gives an example of graphical representing for a message with power Pi and elevation 
6i. Figures [TOlb) and [TOtc) illustrate overlapping and non-overlapping configurations, respectively. 

Definition 13: The quadruple {m,p,r, Nq) is a spin-off of {M,P,R,Nq) if there exists a surjective mapping 
: {1, . . . , m} ^ {1, . . . , M} such that for alH G {1, . . . , M} we have 

R. < Yl ^i- 

where is the set of all j G {1, ... , m} that map into i by means of (j). 
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Definition 14: A hyper-user with power P, rate R, is obtained by merging d actual users with powers (Pj^ , • • • , Pi^ ) 
and rates {Ri^,. ■■ ,Ri^), i.e, 

d d 

P = Y.P^,, R = Y.Ri.. (99) 

k=l k=l 

Theorem 10: For any M-user achievable configuration (M, P, R, Nq), there exists a spin-off (m, p, r, Nq) which 
is single user codable. 

Proof: See Theorem 1 of [8]. ■ 
Here, we give a brief sketch of the proof to give intuition about the algorithm. The proof is by induction on M. For a 
given configuration, if none of the messages are overlapping then the spin-off is trivially equal to the configuration. 
Otherwise, merge the two overlapping users into a hyper-user of rate and power equal the sum rate and sum power 
of the overlapping users, respectively. Now the problem is reduced to rate splitting for (M — 1) users. This proof 
suggests a recursive algorithm for rate-splitting that gives the actual spin-off for a given configuration. 

It follows directly from the proof of Proposition \T0\ that this recursive algorithm gives a single-user codable 
spin-off for an achievable configuration. If the configuration is not achievable, then the algorithm encounters a 
hyper-user with negative elevation. At this point the algorithm terminates. Suppose that hyper-user has rate R and 
power P. Negative elevation is equivalent to the following 

R>C{P,No). 

Hence, by Definition [14] we have, 

ies ies 

where S = {ii,. . . ,id} C M. Therefore, a hyper-user with negative elevation leads us to a violated constraint in 
the initial configuration. 

Theorem 11: The presented algorithm runs in 0{M^ logM) time, where M is the number of users. 

Proof: The computational complexity of the algorithm can be computed as follows. The algorithm terminates 
after at most M recursions. At each recursion, all the elevations corresponding to a configuration with at most M 
hyper-users are computed in 0{M) time. It takes 0{M log M) time to sort the elevation in an increasing order. 
Once the users are sorted by their elevation, a hyper-user with negative elevation could be found in 0(1) time, 
or two if such a hyper-user does not exists it takes 0{M) time to find two overlapping hyper-users. In the case 
that there are no overlapping users and all the elevations are non-negative the input configuration is achievable, 
and the algorithm terminates with no violated constraint. Hence, computational complexity of each recursion is 
0(M) + 0(M log M) + 0{M) = 0{M log M). Therefore, the algorithm runs in 0{M'^ log M) time. ■ 

Appendix II 
Proof of Lemma [5] 

First, consider the following lemmas. Lemma |9] bounds Jensen's difference of a random variable for a concave 
function. The upper bound is characterized in terms of the variance of the random variable. 

Lemma 9: Let / : M ^ be concave and twice differentiable. Let X be a random variable with variance cj\. 
Then, 

/(E[X]) -E[/(X)] < ^2MaynX]) " (100) 
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Fig. 10. Graphical representation of messages over multi-access channel [8]. 



where M be an upper-bound on |/"(x)|. 

Proof: Pick any < e < 1. By Chebyshev's inequality we have 



where c = Therefore, we have 



E[/(X)] = E 
-I- E 



Fr{\X -E{X)\ > c) < e, 

f{X) \X - E{X)\ < cl Pr(|X - E(X)| < c) 

f{X) \X - E{X)\ > cj Pr(|X - E(X)| > c) 

> {l-e)E\f{X) \X -E{X)\ <c 
1 - e 



(101) 



> 



f{E[X]+c)+f{E[X]-c) 



(1 - e)/(E[X]) + —-c\f"{Ci) + 



(102) 



where the first inequality follows from non-negativity of /, and the second and the second inequality follows from 
concavity of /. The scalars G [E[X],E[X] + c] and ^2 G [IE[X] - c,K[X]] are given by Taylor's theorem. 
Given the above relation, for any e > we have 



/(E[X]) - K[f{X)] < -—alM + 6/(E[X]) 



The right-hand side is minimized for 



mm 



2e 



(t\M 



^ ,2/(IE[^]), 

By substituting e* in (11031 ). the desired result follows immediately. 



(103) 



(104) 



We next provide an upper bound on variance of y = log(l + X) proportional to the variance of X. 
Lemma 10: Let X > be a random variable with mean X and variance cj^, and Y = log(l + X) then variance 
of Y is upper-bounded as 

4<a|(l+ (l + X)(^21og(l + X)-^) j. (105) 
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Proof: Let E(y) = log(l + X) for some X < X. By invoking the mean value theorem, we have 

\ 2" 



4 



= E 

= E 
< E 



log(l + X)- log(l + X) j 



i + x 
{x-xf 



(106) 



where X is a non-negative random variable. 

On the other hand, by employing lemma |9] with f{x) = log(l + x), we can write 



E [ log(l + X)] > log(l + X) - J2a\ log(l + X) + 



2 ' 



(107) 



Hence, 



X > X = exp {E[log(l + X)]} - 1 
> exp I log(l + X) - 



2ailog(l + X) + ^ 



> X-ax(l+X)(V21og(l + X 



(108) 



where the first inequality is by (11071 ). and the second relation can be verified after some straightforward manipulation. 
By combining (11061 ) and (11071 ) the variance of Y can be bounded as follows 

< E[{X-Xf] 

< E 



X-X + ax{l + X)(V21og(l + X) - 



(l + X)(V21og(l + X)-^] 



(109) 



Now we provide the proof for Lemma [5] Define random variable Ys as the following: 

Ys = llog{l + Y, 



for all = {1,...,M}. 



ies 



No 



(110) 



The facet defining constraints of Cg{P, H) and Ca{P) are of the form of X^jgg Ri < Ys and X^jg^ Ri < E[y5'], 
respectively. Therefore, by DefinitionH we have dn {Cg{P, H),Ca{P)) < 5 if and only if \Ys - E[Ys]\ < 6, for 
all S C A4. Thus, we can write 



{dHiCgiP,H),CaiP))>6} = Fr{max\Ys-E[Ys]\>6} 

< Y.^''{\^s-nys]\>s] 



SCM 

< ^E^.- (Ill) 

SCM 
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where the first inequaUty is obtained by union bound, and the second relation is by applying Chebyshev's inequality. 
On the other hand, cjy can be bounded from above by employing Lemma [TOl i.e.. 



{l + Zs){\l2\og{l + Zs)-^) 



(112) 



where 

Hi Pi 



No 

i&s ^ ies 



Hi Pi 



4.=var(j;^)= r,T,cov{H,,Hj)=T'sKTs. 

i&S ° {i,j)eS^ 

The desired result is concluded by substituting Zs and cj| in (II 12b and combing the result with (111 lb . 



Appendix III 
Proof of Lemma [6] 

Let us first state and prove a useful lemma which asserts that Euclidean expansion of a capacity region by 5 
contains its expansion by relaxing its constraints by 6. 

Lemma 11: Let Ci be a capacity region with polymatroid structure, i.e., 

Ci = ji?G M^^ : < for all S C mV (113) 

where f{S) is a nondecreasing submodular function. Also, let C2 be an expansion of Ci by 5 as defined in 
Definition |5] Then, for all R G C2, there exists some R! G Ci such that \\R — R!\\ < 6. 

Proof: By Definition \T5\ it is straightforward to show that C2 is also a polymatroid, i.e., 

C2 = lRe ■YRt< 9{S) = f{S) + 6, for aWS Cm\, (114) 

where g{S) is a submodular function. By convexity of C2, we just need to prove the claim for the vertices of C2. 
Let R G M*^ be a vertex of C2. The polymatroid structure of C2 implies that R is generated by an ordered subset 
of M (see Theorem 2.1 of [32]). Hence, there is some k £ M such that Rk = f{{k}) + 6. Consider the following 
construction for R': 

, \ Ri — 5, i = k 

i?:= ^ . (115) 

[ Ri, otherwise. 

By construction, R! is in a 5-neighborhood of R. So we just need to show that R' is feasible in Ci. First, let 
us consider the sets S that contain k. We have 

5^i2: = 5^i2,-J</(5). (116) 

ies i&s 

Second, consider the case that k ^ S. 

idS ie5u{fc} 

< f{S\J{k]) + 5-Rk 

< f{S) + f{{k}) + 6-Rk 
= f{S), 
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where the first inequaUty comes from dl 161 ). and the second inequaUty is true by submodularity of the function 
/(•). This completes the proof. ■ 
Proof: [of Lemma |6l Without loss of generality assume that u{R2) > u{R[). By Lemma [TTl there exists 
some R S Ca{P) such that ||i?2 — R\\ < ^- Moreover, we can always choose R to be on the boundary so that 
||-R|| > Ds, where Ds is defined in ( [39l ). Therefore, by Assumption Oa) and the fact that u(f?2) ^ ^(-^i) ^ u{R), 
we have 

u{R2) - u{R) = \u{R*2) - u{R)\ < B\\R*2 - R\\ < B5. (117) 

Now suppose that \\R\ — R\\ > {^5)^. By Assumption Ob) we can write 

u{R\) - u{R) = \u{R\) - u{R)\ > A\\Rl - Rf > B6. (118) 

By subtracting dl 171 ) from dllSI ) we obtain u{R2) < u{Ri) which is a contradiction. Therefore, — < (^(5)i, 
and the desired result follows immediately by invoking the triangle inequality. ■ 
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